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Abstract 


In this report, we consider the performance of trellis coded multilevel 
differential phase-shift-keying (MDPSK) over Rician and Rayleigh fading 
channels. For operation at L-Band, this signaling technique leads to a more 
robust system than the coherent system with dual pilot tone calibration 
previously proposed for UHF. The results are obtained using a combination of 
analysis and simulation. The analysis shows that the design criterion for 
trellis codes to be operated on fading channels with interleaving/ 
deinterleaving is no longer free Euclidean distance. The correct design 
criterion for optimizing bit error probability of trellis coded MDPSK over 
fading channels will be presented along with examples illustrating its 
application. 
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I. 


Introduction 


In a previous report [1], the authors considered the performance of 
trellis coded multilevel phase-shift-keying (MPSK) when transmitted over a 
fading mobile satellite channel. Such a channel is characterized by Rician 
fading which can be described by a single parameter K corresponding to the 
ratio of power in the total coherent (direct plus specular) component to that 
in the noncoherent (diffuse) component. The analysis was carried out under 
the assumption of ideal coherent detection, i.e., the effect of the fading on 
the phase of the received signal was fully compensated for at the receiver. 
Thus, the numerical results derived from this analysis reflected only the 
degradation due to the effect of the fading on the amplitude of the received 
signal . 

Although most of the discussion and numerical results presented in [1] 
pertained to this idealistic assumption, mention was made of the fact that, in 
an actual implementation, the carrier demodulation function would typically be 
performed using pilot tone calibration techniques [2, 3]. Assuming a dual 
tone calibration technique [4] wherein two tones of equal power are inserted 
symmetrically at the edges of the data spectrum for the purpose of deriving a 
coherent demodulation reference, simulation results presented in [1] showed 
that the noisy carrier reference so produced resulted in a signal-to-noise 
ratio degradation of about 2 to 3 dB depending on the value of the Rician 
parameter K. This rather large degradation is caused by the fact that the 
bandwidth of the pilot tone bandpass filters has to be chosen wide enough to 
accommodate the maximum doppler (on the order of 100 Hz at UHF for a land 
mobile vehicle travelling at 80 mph) and that some of the total transmitted 
power has to be allocated to the two pilot tones thus robbing the data-bearing 
signal . 
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Under these circumstances, it is reasonable to consider the performance 
of trellis coded multilevel differential phase-shift-keying (MDPSK) with the 
hope that the performance penalty associated with the differential detector 
will not exceed that due to the noisy carrier demodulation reference in the 
coherent system. If this is indeed true (as we will be shown in this report), 
then the MDPSK system has a decided implementation advantage over the coherent 
MPSK one in that a means for extracting a carrier demodulation reference does 
not have to be provided. The results will be obtained using a combination of 
analysis and simulation. Also, as in [1] we shall consider only the case 
where interleaving/deinterleaving is employed to combat the fading. This 
allows for considerable simplification of the analysis and is of great 
practical interest. 

II . System Model 

Figure 1 is a simplified block diagram of the system under investigation. 
Input bits representing data or digitally encoded speech and occurring at a 
rate are passed through a rate n/(n + 1) trellis encoder producing an 
encoded symbol stream at a rate [(n + l)/n]R^. The encoder output symbols 
are then block interleaved to break up burst errors caused by amplitude fades 
of duration greater than one symbol time. 

While in practice the depth of interleaving is finite and chosen in 
relation to the maximum fade duration anticipated which in turn is related to 
the maximum doppler frequency produced by the mobile's movement, we shall make 
the usual assumption of infinite interleaving depth. Our simulation results 
will, however, reflect a finite interleaving depth necessitated by practical 
limitations on the total coding/decoding delay in a speech transmission system 


2 


[1]. Thus, these results will be slightly pessimistic when compared with 
those derived from theory. The primary purpose of the analysis is to indicate 
through simple example the trend of the performance behavior as a function of 
the various system parameters leaving the actual numerical performance to be 
predicted by the software simulations. 

The interleaved trellis-encoded symbols are next mapped, in groups of 
n + 1, into an M = 2 n + 1 - level PSK signal set according to the set 
partitioning method [5] . The MPSK symbols are then differentially encoded and 
modulated onto an RF carrier for transmission over the channel. Since the 
MDPSK symbol rate is R^/n, it is reasonable, from a conservation of 
bandwidth standpoint, to compare the performance of this system to an uncoded 
M = 2 n - level DPSK system with the identical input bit rate. 

At the receiver, the faded, noise-corrupted signal is differentially 
detected and the resulting symbols are then deinterleaved. The trellis 
decoder is implemented as a Viterbi algorithm with a metric depending upon 
whether or not channel state information (CSI) is provided [6]. A measure of 
CSI can be obtained from envelope detection of the received signal. 

In selecting a decoding metric, a tradeoff exists between simplicity of 
implementation and the optimality associated with the degree to which the 
metric matches the differential detector output statistics. For the case of 
uncoded MDPSK, a metric based on minimizing the distance between the received 
and transmitted signal vectors (equivalent to assuming Gaussian statistics as 
in the coherent MPSK analysis of [1]) is optimum in the sense of a minimum 
probability of error test. For the coded case, the optimum (maximum- 
likelihood) metric would depend on the joint two-dimensional (amplitude and 
phase) statistics of a sequence of receptions. 
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In this report, we shall consider both the maximum-1 ikelihood (truly 
matched) and the much simpler to implement (and analyze) Gaussian metric. 
System performance results for the latter will be obtained through a 
combination of analysis and simulation. 


III. Analysis Model 

The basic analysis model for the block diagram of Figure 1 is illustrated 
in Figure 2. The box labelled encoder is actually the combination of the 
trellis encoder and the mapping function of Figure 1. We denote a coded 
symbol sequence of length N by 

x = ( x^ , x^ ) ( 1 ) 

where the kth element of x, namely x^, represents the transmitted MPSK 

symbol in the kth transmission interval and, in general, is a nonlinear 

function of the state of the encoder s, and the n information bits u, at its 

k 

input. Before transmission over the channel, the sequence x is differentially 
encoded producing the sequence v. In phasor notation, v^ and v^ + ^ can be 
written as 




v, = / 2E e 
k s 


V ■ \ Vi = <**. e 




j<j> 


= /2E e 
s 


k+1 


where = nE^ is the energy per MDPSK symbol and 




*k 


= e 


is the phasor representation of the MPSK symbol A4>^ assigned by the 
mapper in the kth transmission interval. 


( 2 ) 


(3) 


k 


Corresponding to x, the channel outputs the sequence 

I = (y x . y 2 . — » y N ) 

where the k + 1st element y k+ ^, representing the output in the k + 1st 
transmission interval, is given by 


( 4 ) 


k+1 


* 

w, w. 


k k+1 


= \h 


* * 

JW e + N, 

S K 


j ( V A4> k+i ) 

Xk + i STeT e + N. 


k+1 


jA<j> 


k+1 


= 2E g e + noise terms 


(5) 


Here N^, N k+ ^ are samples of a stationary, complex Gaussian noise process 


N(t) which represents the additive thermal noise at the receiver front end, 
and X^, X k+ ^ are samples of a normalized, stationary, complex Gaussian 
noise process X(t) (independent of N(t)) which represents the fading 
characteristic of the mobile satellite channel. The first two moments of the 
random variables (r.v.'s) X^ and are given by 

Efxp » M i E{|XJ 2 } - | M | 2 = o 2 ; o 2 + Ivl 2 = 1 


E(N k > = 0 ; E{N k » 4 ) = 

Writing X^ in the phasor form 

j<J> k 

X k = ?k 6 

we see that p k is a normalized (unit mean-squared value) r.v. with a 
Rician probability density function (p.d.f.) given by 

2. 


(6) 


(7) 


P(P k ) = \ 


2p k (l + K) exp[- K - p k (l + K)] I Q (2p k Ml + K) ) ; p fe > 0 


0 ; otherwise 


( 8 ) 
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where once again the parameter K represents the ratio of powers of the 
coherent to noncoherent fading components. Finally, if CSI is available, then 
the corresponding side information sequence z will be denoted by 

2 = , z. , • • • , Z^) (9) 

As previously stated, we shall assume infinite depth interleaving and 
deinterleaving so that the coding channel is memoryless. Under this 
assumption, the products are independent r.v.'s and hence the 

joint channel probabilities satisfy 
N 

p n (z|x,z) = J~~ 1 p(y n l x n , z n ) (10a) 

n=l 


and 




N 


FK> 


(10b) 


IV. The Maximum-Likelihood Metric for Coded MDPSK 

For any coded communication system, the decoding process uses a metric of 
the form m(y,x;z) if side information is available and m(y,x) if it is not. 
Whatever metric is selected, it is desirable from the standpoint of 
simplifying the decoding process that it have an additive property, namely 
that the total metric for a sequence of symbols is the sum of the metrics for 
each channel input and output pair, i.e.. 


m (x*£»£) 


N 


= / m(y,x;z) 
/ j n n n 

n=l 


(ID 
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The maximum- likelihood metric 


nu 


m(jr,x) = log P N (jr|x) 


i(X,x;z) = log P N (z I x, z ) (12a) 

when side information is available or 

(12b) 

when no side information is available satisfies the requirement in (11). This 
is easily seen by substituting (10) in (11) and recalling that the logarithm 
of a product equals the sum of the logarithms. Thus, to evaluate (11) we need 
to obtain the marginal p.d.f. P(y n l x n ' z n )» take its logarithm, and sum 
the results. 


Assuming perfect side information, i.e., z^ = p^, the characteristic 

function method is used in Appendix A to derive p(y lx ,z ) with the result 

n 1 n n 


p(y|x n ,z) = / — j-t- 

L 2ir(l + N 2 R 2 ) 


exp 


2 2 1 
2p E N_R 1 
r n s 0 

2 2 

1 + V 


x J 


0 



n 


2p 2 E x 
r n s n 


2 2 

1 + NqR 



(13) 


where Jq(x) is the zero order Bessel function of the first kind. The 
logarithm of (13) gives the maximum-likelihood branch metric 
m (y n |x n ,z n ) which is quite complicated to implement. Furthermore, 
theoretical analysis of the bit error probability performance of the system in 
Figure 1 with such a metric is difficult if not impossible. 

For this reason, we turn our attention now to the much simpler Gaussian 
metric for which upper bounds on bit error performance can be readily 
computed. This approach is analogous to that taken in [7]. 
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V. Derivation of the Pair-Wise Error Probability Bound 

To find an upper bound on the average bit error probability performance 
of the system, we must first find the pair-wise error probability P(x •* x) 
which represents the probability of choosing the coded sequence x = (x^, x^, 
x^) instead of x = (x^, x^» . .., x^). Letting m(y,x;z) denote the 
coding decision metric, i.e., x is incorrectly chosen as the transmitted 
sequence when m(y,x;z) > m(y,x;z), then the pair-wise error probability has the 
Chernoff bound [6] 

P(x -» x) < E{exp(X[m(y,x| z) - m(y,x|z)] ) |x} 

= PI E { e x p (X[m(y n ,x n |z n ) - m(y n ,x n |z n )] |x} (14) 

nen 

where the "E" operator denotes statistical expectation, X is the Chernoff 
parameter to be optimized, and n is the set of all n such that 
x^ ^ x^. To simplify (4) any further, we must specify a particular 
metric and whether or not channel state information is available. As 
previously stated, we shall use the metric which is optimum (maximum- 
likelihood) for the additive Gaussian noise channel. We begin by treating the 
case where CSI is absent. 

A. No Channel State Information 

When no CSI is provided the Gaussian metric takes the form 

m(y n> x n ) = -I y n - 2E s x J 2 (15) 

o o 

Substituting (5) into (15) and noting from (3) that lx I = lx I =1 

1 n 1 1 n 1 

independent of n, the difference of the metrics required in (14) becomes 
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( 16 ) 


m ( y n ,i n ) - m(y n ,* n > = 


i * * i 2 i * .2 

- w w - 2E x + w ,w - 2E x 
' n-1 n s n 1 1 n-1 n s n 1 


= 2E [w 
s 

which can be convenient 

n,( V i n ) - m( V x n 


*w (x - X* 
n-1 n n n 


ly written in 


. *T„- 

) = 2E W (A 
s ~ n — n 


) + w ,w (x - x 

n-I n n n 

the matrix form 

- A )W 
— n — n 


)] 


where 


(17) 


W 
— n 


_ — 




*“ 

w 1 


0 x 


0 x 

n-1 

/s 

n 


n 


; a = 


; A = 



— n 

A 

’ — n 


w 


x 0 


x 0 

n 


n 


n 

— - 


— _ 




(18) 


and "T" denotes the transpose operation. 

From (5) and (7) we have 
* jA<t> 

W n-l W n = P n-l p n 2E S e ” + noise terms (19) 

Assuming that the fading is slow-varying enough that P n ^ = P n » 
substituting (17) into (14) gives 


a K 

P(x ■* x) = P(x •> x | £>_) ; 

if! Ejexp/x'w^ 1 ^ - A n )M n \x n .P n 


nen 


( 20 ) 


where the expectation is now only over the additive noise, the overbar denotes 

statistical averaging over the fading, and X’ = 2E X. 

s 

The expectation required in (20) was originally evaluated by Stein [8] 
and later by Johnston [7] in connection with the analysis of a block coded 
MDPSK system. In particular, for any nen. 
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E«fexp(X'W* T (A -A)W)|x,p\ = 
^ r -n — n — n — n 1 n n j 


expfx'^Cr 1 + 2 X'R^) hJ 

det(I + 2X'R*F ) 


expfx^F/I ■> 2XjX) Jtj 

det(I + 2X'R*F ) 

— — n~ a 


where I is the identity matrix and 


0 (x -x ) 

_ A i n n 

F = A - A = 

— n — n -n 

x -x 0 
n n 


^ = E{W n |x n ,p n ) . 


E{w , lx ,p } fp . 
1 n-1 1 n r n J . Hn-l 


E{w |x ,p } 
n 1 n r n 


h - I E «^ - U n )’'(« n -K n ) I P n .P n > 


E < I Vl - ^n-1 1 > 


! «Vl - Vl> ( "n - l‘n )> 


E(( Vl - Vl )( *« - E{ l"n - »‘n |2> 


Also, it should be noted that (17) is valid only when det(J + ZX'R^F^) > 0 


After much manipulation, (21) evaluates to 


E{exp(\'W (A ~ A )W )|x ,p } = 
— n — n — n — n 1 n n 


exp f x o r Q p n 


2X n |x - x | + Re{x (x - x ) } 

2 O'n n 1 1 n n n J 


1 - (2 V 2 |J n - xj 2 


i - (2y 2 |x, - *„l 2 


where X~ = X'N.= 2E N~X. 

0 0 s 0 

One further simplification of (23) is possible for constant envelope 
signal sets such as MDPSK. In particular, it can easily be shown that 

lx - x P = -2Re{x (x - x ) } ( 

1 n n 1 n n n 

Substituting (24) into (23) and then into (20) gives the desired result for 
the conditional (on p^) pair-wise error probability bound, namely, 


p (x x|g_) < 


2X 0 j, l £ n - *J 2a - ^Q 1 

1 - Uy 2 !^ - xj 2 

1 - (2X Q ) 2 |J n - x n | 2 


It remains to average (25) over the p.d.f. of given in (8) and then 

optimize over the Chernoff parameter. 

Since we have assumed that the fading is constant over two symbol 

2 

intervals, i.e., p ,p has been replaced by p , and that the 
n-1 n n 

interleaving/deinterleaving makes the P n 's independent, then the average 
over £ in (20) can, insofar as the upper bound is concerned, be computed as 
the product of the averages. Averaging each term in the product of (25) over 
the p.d.f. in (8) gives* 


*For simplicity of notation, we herein drop the zero subscript on X. 
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n 


exp 


2 X t^ p n l J n - *J (L " 4X) 

1 - (2X) 2 |i n - xj 2 
1 - (2X) 2 |i n - x n | 2 


1 + K 


1 + K + x - x 
1 n n 1 


2X r (1 

0 


- 4X) - (2X) (1 + K) 


x exp<-K 


2X (1 - 4X) | x - x | 2 
Nq 1 n n 1 


1 + K + x - x 
1 n n 1 


2X rp (1 - 4X) - ( 2X ) 2 ( 1 + K) 
0 


] 


(26) 


For the Rayleigh case (K = 0), (26) simplifies to 

P, 


n 


exp - 


2X p ' K - x n | 2(1 * 4X) 

1 - (2X) 2 |j n - xj 2 
1 - < 2X ) 2 |i n - x n | 2 


1 + 


x - X 
1 n n 1 


2X 5 s (1 - 4X) - (2X) 2 

. N o 


] 


(27) 


The result in (26) cannot be optimized over X independent of the 
index n. Thus, for the Rician case, we first must compute the pair-wise error 
probability (or better yet, the average bit error probability) and then 
optimize over the Chernoff parameter. On the other hand, the result in (27) 
can be optimized over X independent of n. In particular, we wish to choose 
X to maximize the term in brackets in the denominator of (27). 


12 


Differentiating this expression with respect to X and equating the result to 
zero give the optimum Chernoff parameter 


X = 2 — 

° pt 1 + 2E /N_ 
s U 

Substituting (28) in (27) and simplifying give the upper bound on pair-wise 
error probability for the Rayleigh channel: 


(28) 


P(x * x) 


*n 

neq 



where 


(29) 


v 


V N o 


1 + E /N- 


(30) 


1 . Derivation of the Bit Error Probability Bound 

An upper bound on the average bit error probability is obtained from the 
pair-wise error probability bound as 



ee a(x,x)p(x)P(x -*■ x) 
x,x z<$ 


(31) 


where a(x,x) is the number of bit errors that occur when x is transmitted and 
x is chosen by the decoder, p(x) is the a priori probability of transmitting x 
and ^ is the set of all coded sequences. An efficient procedure for 
evaluating (31) is the transfer function bound approach taken in [9], In 
particular, the trellis code is represented by a pair-state transition 
diagram [10], Each pair-state ( s k »s k ) corresponds to a pair of states s fc and 
s^ in the trellis diagram. Thus, a transition between pair states ( s k »s k ) and 
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(s k+ ^, in the transition diagram corresponds to a pair of transitions in 

the trellis diagram, i.e., s^ to s^ + ^ and s^ to s^^* Associated with each 

of these trellis diagram transitions are an MPSK symbol being outputted by the 

mapper and a corresponding sequence of n input bits (an information symbol) to 

the encoder. Thus, the transition between two pair-states in the transition 

2 

diagram is characterized by the squared Euclidean distance 6 between the 
corresponding MPSK output symbols and the Hamming distance Q between the 
corresponding input bit sequences. 

Based on the above discussion, in the absence of fading, each branch 
between pair states in the transition diagram has a gain G of the form 

.2 


G A 6 


(32) 


Here I is an index, D is the Bhattacharyya distance defined by 
nE, 


D = expl™ 


4N, 


(33) 


and the summation accounts for the possibility of parallel paths between 
states in the trellis diagram. The transfer function (the sum of all possible 
path gains) of the transition diagram is denoted by T(D,I) and, by comparison 
with (31), the upper bound on average bit error probability is given by 


p b^ii I(D ’ I) li=i 


(34) 


Since, for no fading, the factor D in (32) is analogous to the corresponding 
term in the product describing P(x -» x), for the fading case we make the same 

s 2 

association. In particular, for a given branch label gain, D would be 
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replaced by (26) or (27) as appropriate with |x - | representing the 

Euclidean distance between pair-states as discussed above. 

2. An Example 

Consider the case of rate 1/2 trellis coded asymmetric QPSK using a 
simple 2-state trellis. The appropriate set partitioning is illustrated in 
Figure 3, the trellis diagram in Figure 4, and pair-state transition diagram 
in Figure 5. The performance of this system in the absence of fading and with 
coherent detection was treated in [9] with the following results: 


4 4a 

4ac 1^4 . I^l+a 1,.1+a 

T(D, I ) = l _ 2b ; a = -D ; b = -D ; c = -D 


(35) 


Here a is the ratio of powers between the I and Q channels which is related 
to the angle 4> that defines the asymmetry (see Figure 3) by 

a = tan^ (36) 


Also, note the branch gains a, b, and c are of the form in (32). Substituting 
(35) into (34) gives 


4(l+2a) 



which when optimized over the asymmetry produces the desired upper bound 

p b < r exp(-2E b /N 0 ) (38) 

Based upon the above discussion, for differential detection in the presence of 
Rayleigh fading, the transfer function of (35) applies with, however, a, b, 
and c defined by 


15 


I f , 2v 

a = ~ v> ) 



Performing the differentiation required in (34) gives upon simplification 



For a symmetric QPSK signal set (a = 1), (40) reduces to 


( 39 ) 


(40) 


P b <2(t^f) <2-v 2 ) 

On the other hand, optimizing (40) with respect to a produces the optimum 
asymmetry condition 


(41) 


a opt 1 4 


: )V('- -is) 


(42) 


which can then be substituted into (40) to give the optimized bit error rate 
performance bound. 


Illustrated in Figure 6 is the upper bound on bit error probability 
performance for the symmetric signal set as given by (41) and the comparable 
upper bound for the optimized asymmetric signal set as determined from (40) 
together with (42). Also shown in this figure are the corresponding results 
for ideal coherent detection as obtained in [1]. Finally, the results for 
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uncoded BPSK (same bandwidth as rate 1/2 trellis coded QPSK) are superimposed 
on the same figure. Although the performance of uncoded BPSK over a Rayleigh 
channel can be obtained in exact closed form, for fairness of comparison, the 
uncoded performance curves in Figure 6 are upper bounds determined from the 
same Chernoff bound approach as was used for the coded results. For MPSK, 
these bounds are mathematically given by 


M-l 


P b± 


i - u 

. . 2 2 irm 

m=l l-o cos — 
M 


(43a) 


for differential detection and 
M-l 


Pt 1 h. ! ♦ /sin 2 f\(l s' 

' A-% 


(44a) 


for coherent detection. When M=2 (BPSK), these results simplify to 


r , 2 1 * 2E b /N 0 

P < i - u = z r 

(1 + E b /N 0 ) 2 


(43b) 


and 

P < i (44b) 

' 1 * E b /N 0 

We observe from Figure 6 the interesting result that, over the range of 
Eb/NQ considered, the differential detection results track the coherent 
detection results with a fixed E^/Ng difference of about 1.5 dB for the 
coded cases and about 3 dB for the uncoded case. 

For differential detection in the presence of Rician fading, the transfer 
function of (35) is again applied with, however, a, b, and c now defined by 
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a = 


1 V ‘‘ 


b = 


1 ^2 

2 ^2° 


c = 


1 C 3 

I V 


( 45 ) 


where 


«t- 


1 + K 




1 + K + 

1 

2X rr(l-4X) - (2X) Z (1+K) 
N 


0 

6?K (8XK1-4X) 


1 + K + 

1 

\ 9 

2X -p(l-4X) - (2X) Z (1+K) 


0 


; i = 1, 2, 3 


with 


2 2 
= 4 ; = 


4 .2 _ 4g 

l + a ; 3"l+a 


(46) 


(47) 


Once again performing the differentiation required in (34) and recalling that 
we must now also minimize over the Chernoff parameter X as well as the 
asymmetry parameter a, we get the following result: 


P, < min min 

b ~ X>0 


M3 D 




“ (l - 5/2)' 


(48) 


Figure 7 is a plot of (48) versus E^/N^ for a Rician parameter 

K = 10 (typical of the mobile satellite channel). Also shown 

are the results for the symmetric signal set, i.e., the minimization over a 

in (48) is not performed; instead a = 1 is used. Finally, the upper bound 

on the performance of uncoded BPSK in the same environment, namely. 
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( 49 ) 


P < min LD 

D X 1 

is also shown. This result is a special case (M = 2) of the more general 
result for uncoded MPSK, i.e., 

M-l ^ 

P b < min ^ 5 m D ” (50) 

A. 1 
m=l 

2 A 2 

where, rather than (47), = 4 sin irm/M; m = 1, 2,..., M is used in (46). 

As was done in Figure 6, the comparable results for coherent detection, 
as obtained from [1], are superimposed on this same figure. We observe that 
once again, over a large range of E^/N^ va ^ ues (where the curves are 
approximately straight lines), the results for differential detection track 
the coherent detection results with a fixed difference of about 1.5 dB for the 
coded cases and about 2 dB for the uncoded case. 

3 . Another Example 

As another example, consider a rate 2/3 trellis coded asymmetric 8-PSK 
modulation again using a simple 2-state trellis. The appropriate set 
partitioning is illustrated in Figure 8, the trellis diagram in Figure 9, and 
the pair-state transition diagram in Figure 10. The performance of this 
system in the absence of fading and with coherent detection was also treated 
in [9] with the following results: 



T(D , I ) = 2d + 


2 (a L + a 2 ) c 
1 - 2 b 1 


1 " 2 


2 2 

V 2 V 

D I + D I 


C “ 2 


D ° (I + 1) 


a 2 ~ 2 


b =1 


2 2 

V V 2 

D I + D I 


2 2 

V V 2 

D I + D I 




D I 


(51) 


The set of squared distances from signal point 0 to signal point j = 


2 

1,2,3,. ..,7, denoted by Sj , are given by 
5 2 = 4 sin 2 = 2 (1 - cos 4> ) ; 


6 2 = 4 sin 2 - |j= 2 (1 + cos 4>) 


2 

= 2 ; 


S 6 = 2 


S 2 * 4 sin 2 ^4 + = 2 (1 + s ^- n ♦)» $ 2 = 4 sin 2 ^ - || = 2 (1 - sin 4») 


S 4 = 4 


(52) 


Applying (34) to (51) gives the upper bound 


p ( in 1 fi + d1‘ (d 5 ' ♦ D ' 5 
d - <}> 2 U + 


2 - D u 7 


(53) 


2 2 ' 

6 7 S 3 
1 - D - D 


For differential detection in the presence of Rician fading, the transfer 

s . 2 £ • 

function of (51) still applies with, however, D 1 replaced by £.D X ; 


i - 1,2,..., 7 where and C. are defined in (46) with E, replaced by E = 2E, 

1 n #5 r 
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but the 5^'s are now given by (52). Performing the differentiation required 
in (34), and recalling that we must now also minimize over the Chernoff 
parameter X as well as the asymmetry angle 4>» we get (53) with the same 
replacement as above, namely. 


mm mm ± r 
b 1 X>0 * 2 h 


1 <4 (V 1+ h D ^)( 2 ~ h° 7 ) 

+ 


(54) 


Figure 11 is a plot of (54) versus E^/Nq for a Rician parameter K = 10. 

Also shown are the results for the symmetric signal set obtained by setting 
<j> = ir/4. Finally, the upper bound on the performance of uncoded QPSK in 
the same environment is determined from (50) with M=4. 

As before, the comparable results for coherent detection are superimposed 
on this same figure. These results are obtained from the analysis performed 
in [1] and as such are given by (54) with 5- and C- defined as follows: 


q = e 


-K 


-fif ni 


(0) exp (rij (9)) erfc n 


.(0)dej 


n i (0) = r 

VI + K 


X 6^ (E. /N n ) _ 

1 b 0_ _ yic cos 0 


2 2 

c. = - 4 x*$/ 

l l 


(55) 


with 6^ ; i = 1,2,. ..,7 given by (52). 

Although not as obvious in Figure 7, we observe that, except for the 
possibility of a proportionality constant, the bit error probability 
performances of the differentially coherent and the coherent detection schemes 
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approach each other asymptotically as E^/N^ gets sufficently large. To 

see this better, Figure 12 superimposes the symmetric results of Figures 7 and 

11 on a single grid at the same time extending them to a broader range of 

EL/N n . It is also interesting to observe from Figure 12 that the rate of 
b 0 

decrease of P. with E,/N n is much steeper for the rate 1/2 trellis 
b b U 

coded 4DPSK (or 4PSK) case than for the rate 2/3 trellis coded 8DPSK (or 8PSK) 
case . 

Similar observations can be made for the Rayleigh fading case (see Figures 13 
and 14). In fact, here it is relatively simple to analytically demonstrate 
these behaviors. For symmetric rate 1/2 trellis coded 4DPSK in a Rayleigh 
fading environment, we have already shown that Eq. (41) is an upper bound on 
its bit error probability performance. Substituting (30) in (41) and letting 
E^/Nq become large give the asymptotic upper bound 

g 

P, < r (56a) 

( W 


A similar asymptotic result for the case of coherent detection was obtained in 
[1] and is given by 


P < 9e 2 (56b) 

b ~ 16(E b /N Q )2 

For the symmetric rate 2/3 trellis coded 8DPSK, it is straightforward to show 
that both the coherent and differentially coherent bit error probabilities 

vary inversely with E^/Nq [as opposed to (E^/Nq) ]. 
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The specific results of (56) and the statement that follows can be 
generalized in terms of an upper bound on P^* for arbitrary code rate 
n/(n+l) and the number of points in the signal constellation (i.e., M) of the 

form Pjj < Ko(Eb/Ng)-L. We now investigate the interesting relationship 
between the exponent L and the distance structure of the error event paths in 
the trellis diagram. 

To begin the investigation, consider the asymptotic behavior (large 
Eb/Ng) of the pair-wise error probability for the Rayleigh channel. This 
can be obtained by substituting (30) in (29) and letting E^/Nq become 
large with the result 


P(x -» x) 


< 


z}_ 

(Es/Nq ) 11 


n 

nen 



(57) 


where i < N is the dimensionality of the set n i.e., the number of MPSK 
symbols in the length N sequence x for which ^ x^, and E s is related 
to Ejj as discussed before. Now, since, from (31), the upper bound on P^ is 
a weighted sum of pair-wise error probabilities over all error event paths in 
the trellis diagram, then as E^/Ng becomes large, the dominant term in the 
sum will be that corresponding to the smallest exponent of E^/N^, i.e., the 
length of the shortest error event path in the trellis diagram . Thus , as 
mentioned above, asymptotically as E^/Ng gets large, Pk is approximately 
upper bounded by Kq (E^/Nq) ^ where L is determined from the shortest 
error event path in the trellis diagram and is determined from the product 
of the branch distances associated with this path, as well as the free distance 
of the code. 
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This is an interesting conclusion which has the following implications. 
First of all, it points out that the design of trellis coded MPSK modulations 
to be transmitted over the Rayleigh fading channel and differentially detected 
at the receiver should be guided primarily by maximizing the length of the 
shortest error event path (resulting in the steepest descent of with 

Eb/N()) an< * t * 16 P ro< * uct t * ie branch distances on this path, with 

maximizing the squared free Euclidean distance as a secondary consideration. 

As the channel statistics tend more to a Rician channel (i.e., a strong 
line-of-sight component is present), the design of the code should be guided 
by both the length and branch distance product of the shortest error event 
path and the squared free Euclidean distance. Finally, as the fading 
disappears, resulting in an AWGN channel, the criterion for good trellis code 
design becomes once again only the squared free Euclidean distance as 
suggested by Ungerboeck [5]. 

An interesting side result stemming from the above is that, for trellis 
codes with parallel paths between states (i.e., the length of the shortest 
error event path is unity), the asymptotic performance of these codes on the 
Rayleigh channel varies inversely as E^/Ng. This result is corroborated 
by the numerical results in Figure 13 which correspond to the trellis diagram 
of Figure 9. Also, note that for the rate 1/2 trellis coded QPSK case, the 
shortest error event path has length two (see Figure 4). Thus, P^ varies 
inversely with the square of E^/N^ as demonstrated in Figure 6 and the 
analytical result in Eq. (56a). 

Finally, it can be shown, from the coherent detection results in [1], 

that the asymptotic behavior of P, with E,/N,, varies identically as the 

d b U 

above, the only difference being in the proportionality constant Kq. 

Similar asymptotic behavior can be demonstrated for Rician channels, the 


24 



primary difference being an additional attenuation factor exp(-LK). A more 
complete discussion of the design and asymptotic behavior of conventional and 
multiple trellis codes [11] on fading channels, both for coherent and 
differentially coherent detection, will appear in a future report by the 
authors [12]. 

B. Ideal Channel State Information 


Consider now the case where ideal channel state information is available 
at the receiver. This is tantamount to assuming that the receiver has 
absolute knowledge of the fading amplitude p in each symbol interval. In 
this case, the Gaussian decoding metric becomes 

m(y , x ) = - y - 2E p , p x I (5: 

J n’ n IJ n s^n-l^n n' v 

instead of (11). Following steps similar to (16) - (24), the conditional 
pair-wise error probability analogous to 25 is given by 


exp 

P(X 4 x|£) < | | 


neri 


2X ' E p h \x - x I 2 (1 - 4X'p 2 N_) 
s n ' n n 1 r n 0 

1 - (2X'p V) 2 |x - x I 2 
r n 0 ' n n 1 

1 - (2X'p 2 N_ ) 2 | x - x | 2 
n 0 1 n n* 


(59) 


Unfortunately, even for the Rayleigh case, the average of (58) over the p.d.f. 
of £ cannot be accomplished in closed form, much less the minimization over 
the Chernoff parameter or optimization over the signal set asymmetry. More 
serious than this, however, is the fact that the condition on the determinant 
of I + 2X'R *F given in the statement following Eq. (22) is not satisfied for 
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all values of p in the region (0,°°). As such, the denominator of any 
n 

term on the right hand side of (58) is not always positive and thus averaging 
(58) over a Rayleigh or Rice distribution is not valid. In view of this, the 
upper bounding approach discussed thus far must be abandoned for the ideal 
channel state information case. 

Computer simulation of the two examples in the previous section including 
ideal channel state information revealed that little was to be gained over the 
no channel state information case. The reason for this is perhaps tied to the 
suboptimality of the Gaussian metric since, in the case of coherent detection 
(where the Gaussian metric is optimum), a reasonably significant gain was 
achieved. Because of the above analytical difficulties and the potential lack 
of gain as evidenced by the computer simulations, we shall not pursue the 
channel state information case any further. 

VI . Simulation Results 

In this section, we describe and present the results of a software 
simulation of the system block diagram of Figure 1. The development of a 
simulation has a manyfold purpose. First, note that the simulation is 
indicative of the exact system performance whereas the theoretical bit error 
rate expressions are upper bounds. Second, when the number of states in the 
trellis diagram becomes large (e.g., 16), determining the state transition 
diagram and its associated transfer function is a tedious task; in such cases, 
simulation is the more expedient approach. Finally, system degradation due to 
the finite size of the inter leaver/deinterleaver and decoder buffer imposed by 
the practical constraint on the allowable total delay is analytically 
intractable. Hence, to predict true system performance corresponding to the 
real world environment, one must again turn to simulation. In the next 
paragraph, we expand upon the last of these issues. 
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The block interleaver of Figure 1 can be regarded as a buffer with d rows 


which represent the depth of interleaving and s columns which represent the 
span of interleaving . Thus, the size of the interleaver (in symbols) is 
d x s. Data is written into the buffer in successive rows and read out of the 
buffer (the order in which it is transmitted over the channel) in columns. At 
the receiver, the block deinterleaver performs the reverse operation, i.e., 
the received soft quantized symbols are written into the buffer in successive 
columns and read out in rows. In practice, the interleaving depth should be 
chosen on the order of the maximum fade depth anticipated which, for the 
fading mobile satellite channel under investigation, depends on the doppler 
frequency or, equivalently, the vehicle speed: the smaller the doppler 

frequency, the longer the fade duration and vice versa. The interleaving span 
should be chosen on the order of the decoder buffer size. When this is done, 
the performance degradation (relative to that for the analytically tractable 
assumption of infinite interleaving depth and buffer size) will be inversely 
proportional to the product of interleaving size and doppler frequency. 

On the other hand, the performance of the differential detector will 
degrade directly proportionally to doppler frequency. The reason for this is 
that the fading phase process varies more rapidly. Assuming infinite 
interleaving and decoder buffer size, one can use the same analytical approach 
as previously discussed to derive upper bounds on the bit error probability. 

Example : Here we consider the more practical case of a rate 2/3, 

16-state trellis code combined with symmetric 8PSK modulation (it was shown in 
[9] that, for this case, the additional coding gain produced by the addition 
of asymmetry to the modulation is small and thus we have chosen to ignore 
it.) Although, with much computational effort and the assumption of infinite 
interleaving and buffer size, this system can be analyzed by the approach 
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taken in the previous section, our interest here lies in computing the 
performance with limited interleaving and decoder buffer size as follows. 

At the present time, this system is a candidate for NASA's Mobile 
Satellite Experiment (MSAT-X) project at L-band whose objective is to transmit 
4800 bps of digitally encoded speech over a 5-kHz RF channel with a bit error 

_3 

rate of 10 . To satisfy the previously mentioned constraint on total 

allowable delay, the interleaving size, interleaving depth, and decoder buffer 
size have been optimized at this bit error rate to achieve the minimum bit 
signal-to-noise ratio. For the specified delay constraint (60 ms), the size 
of the block interleaver and deinterleaver have been chosen equal to 128 8PSK 
symbols (or 256 input bits). With the above chosen interleaving size, the 
interleaving depth has been optimized by computer simulation and found to be 
equal to 16 symbols. Thus, the interleaving span is 128/16 = 8 symbols over 
the range of doppler frequencies from 40 Hz to 200 Hz (vehicle speeds of 
15 mph to 75 mph at L-band). Note, however, that for MSAT-X channels 
operating at low doppler frequencies such as 40 Hz, we can indeed have fade 
durations much longer than 16 symbols. In this case, an interleaving size of 
128 symbols is not sufficient, thus a significant performance penalty can 
occur. Finally, with the above delay constraint imposed, the buffer size was 
optimized through simulation and found to be 32 symbols (or 64 bits). 

Figure 15 illustrates the results of the simulation for perfect doppler 
tracking and time synchronization, and no intersymbol interference ( IS I ) - In 
particular. Figure 15 assumes a fixed doppler frequency of 400 Hz, a fixed 
Rician parameter K = 10, and the two cases of no interleaving and limited 
interleaving as described above. Also shown are the corresponding results for 
no fading, i.e., K = °°. From these results, it is clear that a large 
performance penalty is paid when interleaving is not used. Quantitatively 
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speaking, at a bit error rate of 10 , the performance with interleaving is 

1.5 dB worse than that under ideal conditions (i.e., no fading). Without 

interleaving, one must pay an additional 2.5 dB in average bit energy-to-noise 

ratio . 
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Appendix A 


Derivation of The Maximum-Likelihood Branch 
Metric for Coded MDPSK with Perfect Side Information 


Referring to (5), we write w^ and i n rectangular complex 

notation as 

< 4 «r j *k ♦<-»! + jv 2 

» k+1 - X k* L /S s e J( *l' +i *k*l)* Nt+1 , Ul + ju 2 

where U^, are independent Gaussian random variables as are V^, 
Furthermore and are independent of and V 2< The detector output. 

S t jlf 

y^ + ^, in the k+1 transmission interval is given by the product of w^ 
and which, in view of (A-l), can be written as 


y A w w =(UV - UV) + j(UV + U V ) 
k+1 = k k+1 11 22 12 21 


= Y t + jY 2 

The joint characteristic function of Yj_ and Y 2 is 

. , , J ( “l Y l™2 I 2 ) | 

Y («, .»,) = E 
1* z 1 *■ 


(A-l) 


(A- 2 ) 


■ ®s 


E u{ e 


j[(u 1 V 1 -Kd 2 V 2 )U 1 +(to 2 V 1 -u 1 V 2 )U 2 ] 


(A-3a) 


V^j 


which because of the independence between and U 2 becomes 

j(u 1 V 1 + « 2 V 2 )U i| 


$ Y 1 ,Y 2 ( "l’“2 ) = E V 




V V 2 


I j(» 2 V -» V )U 

) 

* Eu je 

V l’ V 2j 

a 


(A- 3b) 
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Since for a Gaussian random variable X with mean X and variance a^, we have 


jXw - ^o 2 co 2 

$ x (u) = } = e X 


(A-4) 


then applying (A-4) to each expectation in the product inside the braces of 
(A-3b) gives upon simplification 




j ( “l V l +tt 2 V 2 )U l 


V, ,v„ 

E 7I 

1 2 

U 2 


j(a) 2 V 1 -« 1 V 2 )U 2 j 


V V 2 


= e 


j (to 1 U 1 -wo 2 U 2 )V 1 -o 2 (to 2 -Ho 2 )V 2 / 2 


x e 


j(a) 2 U 1 -o) 1 U 2 )V 2 -a 2 (coJ-Ko 2 )V 2 /2 


(A-5) 


where a denotes the variance of and U 2 (or and V 2 ), 


Finally, using the relation 


jctX-a 2 BX 2 /21 1_ 


,j e jctX-a BX /2j = 


A + <TB 


exp 


2iaX - g 2 (BX 2 + a 2 ) 


2(1 + o B) 


(A-6) 


to evalute the expectations over and V 2 required in (A-3b), we get after 
much simplification 


$ (co, ,<o„) = 
1*2 


Y Y \"i j'-n/ - 4 2 2 

» Y< > 1 1 1 + <7 ((^ + C0 2 ) 


exp 


2jf 1 (u 1 ,w 2 ) - o f 2 (to 1 ,to 2 )| 

r i 4,2 2v ■, 

2 [ 1 + a (o^ + co 2 )] 


(A-7a) 


where 


f 1 (« 1 ,» 2 ) = (U 1 V l - u 2 v 2 ) 


M 1 + 


( U 1 V 2 + U 2 V 1 ) “2 


(A- 7b) 


f 2 (to 1 ,w 2 ) = (co 2 + o> 2 )(uJ + ^ + V x 2 + V 2 ) 

The next step is to perform the inverse Fourier transform of (A-7a) 
thereby obtaining the joint probability density function of Y^ and Y 2 . In 
particular. 
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, N iff "j (<0 l y l +<J 2 y 2 ) 

Pv Y ' y l’ y 2^ = 9 11 ^ v v * U 9 ^ e dw 1 du„ 

V*2 L 1 (2ir r J J Y V Y z 12 12 

—00 _00 


Making the change of variables 


= R cos 0 


= R sin 9 


and performing the integrations gives after some simplification 




where 


W(R) = 


2ir(l + o R ) 


exp - 


a 2 R 2 (U 2 + U 2 + V 2 + V 2 ) 
2(1 + a 4 R 2 ) 


A(R) = U 1 V 1 - U 2 V 2 - yi (l + oV) 


B(R) = U X V 2 + U 2 V L - y 2 (l + aV) 


Finally, recognizing from (7) and (A-l) that 


U 1 V 1 ' 0 2*2 = 2E sPk + l C0S 


U 1 V 2 + Vl = 2 Vk + l Sin ^ 


-2 -2 -2 -2 2 

U 1 + U 2 + V 1 + V 2 = 4 Vk + l * 

where we have made the assumption that is constant over two symbol 

2 

intervals, i.e., P k P fc+1 = P k+1 » then (A-10) together with (A-ll) becomes 
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1 * 2 


2ir(l + a R ) 


(1 + a R ) J 


2 Vk + i cos ^k+A 2 


x M* ?1 ' i + »¥ 


* 5-2 - 


2 Vk+l Sin 

1 + o 4 R 2 


If we now evaluate the variance o from the noise statistics in (6), we find 
that a 2 = N q . Also recognizing from (3) that Re{x k+ ^} = cos an d 

Im{x k+1 ) = sin A< t> k+1 » then (A-13) can be put in the final desired form, 
namely. 


00 

p Y v <yi»y 2 ) = p(y k+il x k+i ,z k+i ) = / — 2 2. 

Y 1’ Y 2 1 2 k+1 k 1 J Q 2ir(l + NjT) 


2 2 
2p. 1 E N_R 
r k+l s 0 

x exp - 5 — y 
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J 0 R y k + l 


2p k+l E s X k+l 

1 + n 2 r 2 


which agrees with (13), 
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Figure 1. Block Diagram of the Trellis Coded MDPSK System 














Figure 2. General Memory less Channel 
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Figure 3. Set Partitioning of Asymmetric 
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Figure 5. Pair-State Transition Diagram for Trellis Diagram of 
Figure k 
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UNCODED 

BPSK 



E t/N 0 , dB 


Figure 6. Bit Error Probability Performance of Rate 1/2 Trellis Coded 
4-PSK in the Presence of Rayleigh Fading; 2 States, K = 0; 
No Channel State Information 




Ej/Nq, dB 


Figure 7. Bit Error Probability Performance of Rate 1/2 Trellis Coded 
4-PSK in the Presence of Rician Fading; 2 States, K = 10; 

No Channel State Information 
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RATE 2/3 TRELLIS CODED SYMMETRIC 8PSK 
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Figure 11. A Comparison of the Bit Error Probability Performances of Rate 1/2 Trellis Coded 
Symmetric 4-PSK and Rate 2/3 Trellis Coded Symmetric 8-PSK in Rician Fading; K = 




Figure 12. Bit Error Probability Performance of Rate 2/3 Trellis Coded 
8-PSK in the Presence of Rician Fading; 2 State, K = 10; No 
Channel State Information 
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E b /N 0 , dB 

Figure 13. Bit Error Probability Performance of Rate 2/3 Trellis Coded 
8-PSK in the Presence of Rayleigh Fading; 2 State, K = 10; 
No Channel State Information 




Figure 14. A Comparison of the Bit Error Probability Performances of Rate 1/2 Trellis Coded Symmetric 
4-PSK and Rate 2/3 Trellis Coded Symmetric 8-PSK in Rayleigh Fading; K = 0 
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Figure 15. Simulation Results for Bit Error Rate Performance of Rate 2/3, 16 State Trellis 
Coded 8-DPSK 



